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Slocin  School  of  Management,  M.I.T. 

We  consider  the  bi-criteria  scheduling  problem  of  minimizing  a  convex  combination 
of  the  mean  and  variance  of  sojourn  times  in  three  queueing  systems  fa  single  queue,  a 
two  station  closed  network,  and  a  two  station  network  with  controllable  inputs)  populated 
by  various  customer  types.  The  sum  of  the  squares  of  the  pairwise  differences  in  mean 
sojourn  times  among  the  various  customer  types  is  used  as  a  surrogate  for  the  sojourn 
time  variance.  Brovvnian  approximations  to  these  three  scheduling  problems  are  solved. 
and  the  solutions  are  interpreted  in  order  to  obtain  scheduling  policies.  Simulation  results 
show  that  in  the  network  settings,  in  contrast  to  the  single  queue  case,  there  are  priority 
sequencing  policies  that  significantly  reduce  the  variance  of  sojourn  times  relative  to  the 
first-come  first-served  policy. 

Subject  classification:  Production/scheduling:  priority  sequencing  in  a  stochastic  job 
shop.  Queues:  Brownian  models  of  network  scheduling  problems. 


November  1989 


REDUCING  THE  VARIANCE  OF  SOJOURN  TIMES 
IN  MULTICLASS  QUEUEING  SYSTEMS 

Lawrence  M.  Wein 

Sloan  School  of  Management,  M.I.T. 

The  objective  in  scheduling  most  multiclass  queueing  systems  is  to  minimize  the  mean 
sojourn  time  of  customers.  However,  in  many  practical  situations,  it  is  also  desirable 
to  keep  the  variance  of  the  sojourn  times  as  small  as  possible.  For  example,  in  many 
manufacturing  systems,  all  arriving  jobs  axe  quoted  the  same  due-date  lead  time  (due-date 
minus  arrival  date),  and  so  a  smaller  sojourn  time  variance  will  lead  to  better  due-date 
performance.  In  some  service  systems,  the  desire  to  treat  all  customers  as  equitably  as 
possible  dictates  that  a  small  sojourn  time  variance  is  required.  In  many  production 
facilities,  low  variability  in  sojourn  times  leads  to  easy  coordination  with  its  downstream 
customers  and  upstream  suppliers. 

For  single  server,  single  class  queueing  systems,  the  first-come  first-served  (FCFS) 
policy  minimizes  sojourn  time  variance  over  the  class  of  work-conserving  sequencing  policies 
(see  Kingman  [15]).  Furthermore,  Groenevelt  [4]  has  shown  that  FCFS  is  effective  in 
reducing  sojourn  time  variance  in  single  server,  multiclass  queues.  It  is  also  well  known  that 
the  shortest  expected  processing  time  (SEPT)  rule  (see  Klimov  [18].  for  example)  minimizes 
the  mean  sojourn  time  in  many  single  server,  multiclass  queueing  systems.  The  SEPT  rule, 
however,  can  lead  to  a  very  large  sojourn  time  variance;  when  the  load  on  the  system  is 
very  heavy,  customers  with  the  largest  expected  processing  times  may  sit  in  the  queue 
for  a  long  period  of  time,  while  customers  with  the  shortest  expected  processing  time  will 
usually  move  through  the  system  relatively  quickly.  Thus,  a  fundamental  tradeoff  exists  in 
single  server,  multiclass  queues  between  system  efRciency  (minimizing  mean  sojourn  time) 
and  system  equitabiiity  (minimizing  sojourn  time  variance).    As  a  result,  some  authors 
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(see  Jackson  [10]-[12],  Kleinrock  [16],  and  Shanthikumar  [22],  for  example)  have  suggested 
and  analyzed  alternative  sequencing  policies  that  attempt  to  address  this  tradeoff  in  an 
effective  manner. 

However,  very  little  is  known  about  this  tradeoff  in  a  network  setting.  Indeed,  just 
obtaining  sojourn  time  variances  in  single  class  queueing  networks  under  the  FCFS  policy 
is  an  arduous  task  (see  Chapter  4  of  Walrand  [24]  and  references  therein).  In  this  paper, 
the  tradeoff  is  assessed  by  analyzing  a  bi-criteria  scheduling  problem  for  three  multiclass 
queueing  systems.  Before  describing  the  various  criteria,  we  first  need  to  define  the  basic 
queueing  network  underlying  all  three  queueing  systems.  Using  the  terminology  of  Kelly 
[14],  we  assume  the  queueing  network  is  visited  by  various  types  of  customers,  each  with 
its  own  arbitrary  deterministic  route  through  the  system.  In  a  manufacturing  setting,  a 
customer  type  represents  a  particular  product  that  is  produced  by  the  facility.  A  different 
customer  class  will  be  defined  for  each  combination  of  type  ajid  stage  of  completion,  and 
each  customer  class  is  served  at  a  particular  single  server  station  in  the  network,  and  has 
its  own  general  service  time  distribution. 

The  first  of  the  three  queueing  systems  is  a  single  server,  multiclass  queue,  where 
each  of  the  various  types  of  customers  has  its  own  independent  renewal  input  process. 
The  second  system  is  a  closed  network,  where  the  total  population  in  the  network  is  held 
constant,  and  when  a  customer  departs  the  network,  the  type  of  the  new  customer  is  chosen 
exogenously  (in  a  Markovian  or  deterministic  fashion)  according  to  a  specified  product  mix. 
In  both  of  these  systems,  the  scheduling  decision  is  to  dynamically  decide  which  customer 
class  to  serve  next  at  each  station:  we  will  refer  to  these  decisions  as  sequencing  decisions. 
The  third  system  is  a  network  with  controllable  inputs,  where  the  scheduler  also  decides 
when  to  release  the  next  customer  into  the  network.  It  is  assumed  that  there  is  a  specified 
expected  average  throughput  rate  that  must  be  maintained,  and  there  is  an  infinite  line  of 
customers  lined  up  outside  the  network  waiting  to  gain  entrance.  The  type  of  each  entering 
customer  is  again  exogenously  specified  according  to  a  particular  product  mix. 
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Since  analyzing  the  variance  of  sojourn  times  is  so  difficult  in  a  multiclass  queueing 
network,  we  propose  a  surrogate  measure  for  the  bi-criteria  scheduling  problems.  In  par- 
ticular, define  the  surrogafe  variance  of  the  sojourn  times  as  the  sum  of  the  squares  of 
the  pairwise  differences  in  mean  sojourn  times  among  the  various  customer  types.  Thus,  if 
every  customer  type  has  the  same  mean  sojourn  time,  then  there  is  zero  surrogate  variance. 
The  first  of  the  two  criteria  in  ep.:h  of  the  three  minimization  problems  is  the  surrogate 
variance.  In  the  single  station  queue  and  the  network  with  controllable  inputs,  the  second 
criterion  is  the  mean  number  of  customers  in  the  system,  which  is  proportional  to  the 
mean  sojourn  time  in  both  systems  by  Little's  formula  (see  Little  [19]).  In  the  closed  net- 
work, the  second  criterion  is  the  mean  idleness  rate  at  an  arbitrarily  chosen  server,  which 
is  inversely  proportional  to  the  mean  throughput  rate  of  the  network.  Hence  minimizing 
the  mean  idleness  rate  of  a  server  will  minimize  the  mean  sojourn  time  of  customers.  In 
all  three  problems,  a  weight  c  will  be  put  on  the  second  criterion,  and  the  objective  is 
to  minimize  the  weighted  average  of  the  two  criteria.  When  c  =  0,  the  objective  is  to 
minimize  the  surrogate  variance,  and  when  c  —*  oc,  the  objective  is  to  minimize  the  mean 
sojourn  time. 

If  the  primary  objective  of  the  queueing  system  is  to  equalize  the  sojourn  times  across 
the  various  customer  types,  then  minimizing  the  surrogate  variance  is  the  most  appropriate 
criterion.  Moreover,  a  reduction  in  the  surrogate  variance  should  lead  to  a  reduction  in 
the  actual  sojourn  time  variance,  unless  the  reduction  in  surrogate  variance  results  in  a 
large  increase  in  the  mean  sojourn  times.  In  such  cases,  solving  the  bi-criteria  scheduling 
problem  for  various  values  of  c  should  lead  to  a  reduction  in  sojourn  time  variance. 

The  three  bi-criteria  scheduling  problems  will  be  analyzed  using  the  Brownian  network 
model  introduced  by  Harrison  [7],  which  approximates  a  multiclass  queueing  network  with 
dynamic  scheduling  capability  under  balanced  heavy  loading  conditions.  This  model  allows 
the  bi-criteria  scheduling  problems  to  be  approximated  by  control  problems  for  Brownian 
motion.  When  c  — >  cx),  the  three  scheduling  problems  described  above  have  been  amalyzed 
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in  Harrison  [7]  (single  queue),  Harrison  and  Wein  [9]  (closed  network),  and  Wein  [25]- 
[26]  (network  with  controllable  inputs)  by  solving  and  interpreting  a  reformulation  of  the 
approximating  Brownian  control  problem. 

In  this  paper,  we  generalize  these  analyses  to  include  the  bi-criteria  objective  for  any 
value  of  c.  For  the  single  queue,  we  find  in  Section  1  that  FCFS  is  effective  in  minimizing 
the  surrogate  \-ariance,  which  is  not  surprising  in  Hght  of  Groenevelt's  [4]  results.  The 
closed  network  and  network  with  controllable  inputs  are  analyzed  in  Sections  2  and  3. 
respectively,  and  a  sm:iulation  experiment  involving  a  two-station  network  is  undertaken 
in  Section  4  that  demonstrates  the  effectiveness  of  the  procedure. 

The  simulation  results  imply  that,  as  in  the  single  server  case,  the  tradeoff  between 
system  efficiency  and  system  equitability  exists  in  the  network  setting.  However,  whereas 
FCFS  minimizes  sojourn  time  variance  in  the  single  server  case,  we  find  that  it  is  easily 
outperformed  on  this  dimension  in  the  network  case.  This  should  not  be  surprising,  since 
some  customer  types  may  have  longer  routes  (and/or  processing  times)  than  other  customer 
types,  and  FCFS  will  generally  cause  the  types  with  the  longer  routes  to  have  longer  sojourn 
times. 

The  simulation  results  suggest  that  reducing  the  surrogate  \'ariance  does  indeed  seem 
to  be  an  effective  surrogate  for  reducing  sojourn  time  variance.  Also,  the  nature  of  the 
tradeoff  between  the  mean  sojourn  time  and  sojourn  time  variance  is  problem  specific,  and 
depends  on  the  magnitude  of  the  surrogate  \'ariance  under  the  policy  that  minimizes  the 
mean  sojourn  time.  If  the  surrogate  variance  is  small,  then  a  further  large  reduction  in 
sojourn  time  variance  is  not  possible.  However,  if  the  surrogate  variance  is  large,  then  a 
further  large  reduction  in  sojourn  time  variance  may  be  possible.  We  simulated  our  test 
example  under  two  sets  of  data  in  order  to  illustrate  these  two  cases.  It  is  interesting  to 
note  that  in  both  cases,  a  sequencing  policy  was  found  that  simultaneously  reduced  the 
mean  sojourn  time  and  the  sojourn  time  variance  with  respect  to  FCFS. 

In  the  simulation  study,  the  closed  network  and  network  with  controllable  inputs 


were  tested  against  an  open  network  that  used  deterministic  input.  Not  surprisingly,  the 
reduction  in  sojourn  time  variance  from  the  open  network  to  the  other  two  networks  (using 
the  FCFS  sequencing  policy  in  all  ccises)  easily  dominated  the  reduction  in  sojourn  time 
variance  among  the  various  sequencing  policies  (while  holding  the  release  policy  fixed). 

In  summary,  the  easiest  and  most  commonly  used  priority  sequencing  policy  (FCFS) 
maximizes  system  equit ability  in  single  server  queues,  and  therefore,  if  one  wants  to  max- 
imize system  efficiency  with  respect  to  FCFS,  then  one  sacrifices  some  equitability.  Our 
results  suggest  that  in  a  network  setting,  there  exists  sequencing  poUcies  that  improve  both 
system  efficiency  and  system  equitability  with  respect  to  FCFS. 

1.     A  Single  Server  Queue 

Consider  a  single  server  queue  visited  by  A'  customer  classes  who  arrive  according  to 
independent  renewal  processes  with  average  arrival  rates  \k.k  =  l....,A'.  Each  customer 
class  has  a  general  service  time  distribution  with  mean  mj.  and  finite  variance,  and  each 
customer  exits  the  system  after  receiving  service.  Therefore,  the  traffic  intensity  of  the 
queueing  system  is  p  =  ^^.-i  ^k^^k-  Let  Qk{t)  be  the  number  of  class  k  customers  in 
the  system  at  time  t,  and  let  I{t]  be  the  cumulative  amount  of  time  that  the  server  is 
idle  up  to  time  i.  The  Brownian  model  assumes  the  existence  of  a  large  integer  n  such 
that  >/n(  1  —  p)  is  positive  and  of  moderate  size;  a  representative  example  is  p  =  .9  and 
n  =  100.  The  system  parameter  n  is  used  to  define  the  rescaled  processes  Z  =  (Zk)  and  U 
by  Zk{t)  =  Qk{'n.t)/^/n  and  ['(/ )  =  I(nt)/y/n  for  /  >  0,  and  the  Brownian  control  problem 
is  obtained  by  letting  the  parameter  n  approach  infinity. 

Since  the  proposed  scheduling  policy  depends  only  on  the  solution  to  the  workload 
formulation,  we  will  go  directly  to  the  workload  formulation  of  the  Brownian  control  prob- 
lem that  approximates  this  scheduling  problem.  In  Section  6  of  Harrison  [x],  it  is  shown 
that  the  problem  (the  objective  function  will  be  stated  shortly)  is  to  choose  RCLL  (right 


continuous  with  left  limits)  processes  Z  =  [Zk)  and  U  such  that 

K 

^TnkZk(t)  =  B{i)  +  V{t)    for  all    f  >  0.  (1.1) 

Z[t]>    0   for  all  t  >  0.     and  (1.2) 

U  is  nondecreasing  with  U{0)  =  0,  (1-3) 

where  5  is  a  one-dimensional  Brownian  motion  process  with  a  particular  drift  and  co- 
variance,  and  with  B(0)  =  0.  For  all  the  problems  addressed  in  this  paper,  the  form  of 
the  scheduling  pohcy  is  independent  of  the  parameters  of  the  \-arious  Brownian  motion 
processes.  These  parameters  are  complicated  expressions  involving  the  first  and  second 
moments  of  the  interarrival  times  and  service  times,  the  product  mix,  and  the  routing 
information:  we  refer  readers  to  Harrison  [7]  for  their  derivation.  Also,  it  will  suffice  to 
assume  that  Z  and  U  are  nonanticipating  with  respect  to  the  Brownian  motion  B.  Similar 
assumptions  are  made  in  later  sections,  and  readers  are  referred  to  the  earlier  papers  in 
this  area  for  a  justification. 

By  Little's  formula,  our  objective  function  can  be  written  as 

The  first  term  in  (1.4)  is  proportional  to  the  mean  steady-state  sojourn  time,  and  the 
second  term  is  the  surrogate  sojourn  time  variance.  By  the  argmnent  in  Section  6  of 
Harrison  [7],  the  solution  to  this  problem  is  to  choose 

U{t)  =  -    inf    B{s)  (1.5) 

0<3<( 

and  to  find,  for  each  time  t,  the  solution  Z'{t)  to  the  quadratic  program 

mil 


■n^E^^o.iED^^-^r  (i.e) 


A,  A, 

=  1  j=i  k=\ 


subject  to     ^mtZi(0  =  ^nO,  (!■") 

Jt=i 
Z(0>0,  (1.8) 


where  the  one-dimensional  workload  process  W{t)  is  defined  by 

\V{t}  =  B(t)-    inf    Bis).  (1.9) 

0<j<f 

and  represents  the  (scaled)  total  amount  of  work  in  the  system.    Furthermore,  such  a 
solution  is  referred  to  as  a  pathwise  solution  to  the  workload  formulation  (1.1)-(1.4),  in 
that  it  minimizes  the  objective  for  all  times  t  with  probability  one. 
Proposition  1.     When  c  =  0,  the  solution  to  (1.6)-(1.8)  is 

Z;{t)  =  Aillill  for  k  =  1,...,A'  and   t>0.  (1.10) 

P 

Proof.  Since  the  objective  function  is  convex  for  any  value  of  c  >  0  and  the  con- 
straints are  linear,  it  follows  that  the  first-order  Karush-Kuhn- Tucker  conditions  are  suffi- 
cient conditions  for  optimaJity  (see  Avriel  [1],  for  example).  These  conditions  are  to  find 
Z;  and  TT*  that  satisfy  (1.7)-(1.S)  and 


.^'-il^-^^Z,  _^Y^h.,m,>Oi.rk  =  l,....K, 


H  ^k  ^^  Xj 


c+ —  ) nrrik 


(1.11) 


Zi  =  Ofor  ;•  =  1,....A'.  (1.12) 


Readers  may  verify  that  when  c  =  0.  a  solution  to  (1.7)-(1.S),  (l.ll)-(  1.12)  is  tt'  =  0  and 
(1.10).    I 

Thus,  in  the  Brownian  limit,  the  queue  lengths  of  the  various  classes  are  in  direct 
proportion  to  their  respective  arrival  rates  when  c  =  0.  Johnson  [13]  and  Peterson  [20] 
have  shown  that  this  same  proportionality  also  holds  under  the  FCFS  policy  in  the  heavy 
traffic  limit.  Thus,  our  analysis  suggests  that  the  FCFS  policy  is  effective  in  reducing 
sojourn  time  variance  in  single  server,  multiclass  queues.  It  is  encouraging  to  note  that 
our  analysis  concurs  with  a  related  result  using  exact  methods  (see  Groenevelt  [4]). 

In  the  limiting  case  when  c  —*  oo,  Harrison  [7]  shows  that  the  proposed  sequencing 
policy  is  the  shortest  expected  processing  time  rule,  which  again  concurs  with  exact  results 
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(see,  for  example  Klimov  [IS J.  When  0  <  c  <  oo,  we  propose  the  following  procedure  to 
obtain  an  effective  sequencing  policy.  First,  find  the  solution  {Zl{t).z')  to  (1.7)-(1.S), 
( 1.11 )-( 1.12).  This  solution  Z*  will  yield  a  unique  value  of  the  unsealed  workload  vector 
in  steady-state  by  the  conversation  law  in  Kleinrock  [17],  and  then  the  unique  parameters 
for  Kleinrock's  delay  dependent  discipline  (see  Kleinrock  [16]-[17])  can  be  derived  that 
will  achieve  this  workload  vector.  In  particular,  the  synthesis  algorithm  of  Wood  and 
Sargeant  [30]  offers  a  simple  iterative  algorithm  (where  a  quadratic  equation  is  solved  at 
each  step)  to  derive  the  necessary  parameters  (see  also  Federgruen  and  Groenevelt  [3]  for 
a  generalization  of  this  algorithm). 

2    A  Closed  Network 

For  the  networks  in  this  section  and  the  next  section,  we  will  index  single  server 

stations  by  i  =  1.2.  customer  classes  by  k  =  1 A',  and  customer  types  by  j  =  1 J. 

Since  a  different  customer  class  is  defined  for  each  combination  of  customer  type  and 
stage  of  completion,  we  have  A'  >  J.  Let  t(j)  be  the  set  of  classes  that  correspond  to  a 
particular  stage  on  the  route  of  type  _;.  The  total  population  size  of  the  closed  network 
considered  in  this  section  is  always  maintained  at  A',  and  tliis  is  achieved  by  releasing  a 
new  customer  into  the  network  whenever  one  exits.  The  new  customer  will  be  of  class  k 
with  probability  ^/t-  independent  of  all  previous  history,  where  ^jt=i  ^t  =  1:  alternatively, 
entering  customers  can  be  chosen  according  to  the  mix  Qk  in  a  deterministic  fashion,  and 
the  only  change  in  the  analysis  is  the  covariance  matrix  of  the  underlying  Brownian  motion. 
Of  course,  qk  >  0  only  for  those  classes  that  correspond  to  the  first  stage  of  some  customer 
type's  route.  We  will  also  let  qj  =  X]*:gr(i)9*'  ^°  ^^^^  entering  customers  are  of  type  j 
with  probability  q^. 

Recall  that  each  customer  type  has  its  own  arbitrary  deterministic  route  through 
the  network.    In  the  workload  formulation  of  the  Brownian  control  problem,  the  routing 


information  is  captured  in  the  quantity  A/,i,  which  is  the  expected  remaining  processing 
time  at  station  i  for  a  class  k  customer  until  that  customer  exits  the  network.  For  i  = 
1, ...,  /,  define  v,  =  ^jt=i  A/it9/t.  so  that  r,  is  the  expected  total  time  over  the  long  run  that 
server  i  devotes  to  each  newly  arriving  customer.  Define  the  relative  traffic  intensity  p,  for 
station  i  to  be  r,/ max{i'] .  tt}  for  i  =  1,2.  Then  the  balanced  heavy  loading  conditions 
for  the  closed  network  assume  the  existence  of  a  sufficiently  large  integer  A'  such  that  the 
total  population  size  is  A'  and  A'|pi  —  P2I  is  of  moderate  size;  a  representative  example  is 
A'  =  10  and  pi  =  p2  =  I. 

As  in  the  previous  section,  the  queue  length  process  and  server  idleness  process  need 
to  be  rescaled.  this  time  by  the  population  parameter  A'.  Let  Zkit)  =  Qk{N''^t)/y  for 
it  =  1,...,A'  and  t  >  0,  and  U,{t)  =  I{N^t)/N  for  z  =  1,2  and  t  >  0.  Let  5  be  a 
one-dimensional  Brownian  motion  process  with  drift  fx  and  variance  a^ ,  and  let 

.\h  =  PiMik  -  PiMik  for  k  =  l,...,K.  (2.1) 

By  Propositions  2  and  7  in  Harrison  and  Wein  [9],  the  workload  formulation  can  be  ex- 
pressed as  choosing  RCLL  processes  {l\,U2.  Zk)  that  are  nonanticipating  with  respect  to 
B  such  that 

K 

Y^  MkZkii)  =  B{t)  +  p2U,{i)  -  p:U2{t)     for  all  t  >  0,  (2.2) 

*:  =  1 
K 

J2Zk{t)  =  l     forallf>0,  (2.3) 

k  =  i 

Z;t(^)  >    0  for  A- =  1,...,A' andfor  all  t  >  0.    and  (2.4) 

Ui  and  U2  are  nondecreasing  with  t'i(O)  =  L'2(0)  =  0.  (2.5) 

If  the  number  of  customers  exiting  the  network  per  unit  of  time  is  A  over  the  long  run 
and  Zt(oo)  denotes  the  scaled  vector  queue  length  process  in  steady-state,  then  by  Little's 
formula,  the  mean  steady-state  sojourn  time  of  type  j  customers  is  -E'[^;te'-(7)  ^kioo)]/qjX. 
Even  though  the  throughput  rate  of  the  network  is  unknown,  we  can  express  our  bi-criteria 
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objective  as 

■  T     J       J 


minlim  sup  —E 
T—oo     T 


.c-.,r,../    >:>,,5ii:^-5-^,.. 


(2.6) 


In  the  limiting  case  c  — ►  ex;,  the  solution  to  this  problem  (see  Harrison  and  Wein  [9; 
for  details)  is  to  choose 

U'{t)  =  —    sup  [a' -B(s)  +  p,U2{s)]^,      and  (2.7) 

P2   0<j<( 

l-{t)  =  -   sup  [B{s)  +  p2U:{s)  -  6-]+,  (2.8) 

P\   0<s<( 

SO  that  B{t)  +  p2U\(t)  —  pil'2i'f  )■'!:  >  0,  is  a  one-dimensional  reflected  (or  regulated)  Brown- 
ian  motion  (abbreviated  by  RBM;  see  Harrison  [6]  for  a  complete  treatment)  on  the  interval 
[a',  b'].  The  interval  endpoints  are  defined  by  a*  =  mini<t</^-  Mk  and  b'  =  maxi<i..<^^-  A/jt, 
and  a*  <  0  <  b' .  Definine; 


o 


\V{t)^B(t)  +  p2U'{t)-pd'''t).  (2.9) 

and  assuming  (without  loss  of  generality)  that  the  classes  k  =  1 A'  are  ordered  so  that 

maxi<k<h-  -^h  =  AA  and  mini<t<K-  Mk  =  A/2,  then 

7(0            ifk  =  l; 
Z:{t)=  {  \-lit)     ifk==2:  (2.10) 

0  ifk  =  3 K. 


for  all  f  >  0,  where 


',(t)=—^ —     for  all    ^  >  0.  (2.ir 

0*  —  a' 


As  in  the  single  queue  case,  {Z',U')  is  a  pathwise  solution  to  problem  (2.2)-(2.6).    The 
resulting  sequencing  policy  (see  Harrison  and  Wein  [9]  for  an  interpretation  of  the  solution 

(Z*,  U'))  is  to  rank  each  customer  class  k  =  1 A'  by  the  index  A/*,  and  to  award  higher 

priority  at  station  1  (respectively,  station  2)  to  the  classes  with  smaller  (respectively,  leirger) 
values  of  this  index.  Notice  that  this  policy  awards  bottom  priority  at  each  station  to  the 
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class  that  has  a  positive  scaled  queue  length,  thus  maintaining  consistency  with  existing 
heavy  traffic  limit  theorems:  see.  for  example,  Whitt  [29],  Harrison  [5".  and  Reiman  [2'.[. 

Now  let  us  analyze  the  other  limiting  case,  when  c  =  0.    It  is  clear  that  the  set  of 
values  TV'  such  that  there  exist  Zi Zk  satisfying 

K 


W=  Y^^'hZk,  (2.12) 


K 


k=\ 

(2.13) 

Zk  >  O.for   it  =  1.. 

..,A'    and 

(2.14) 

l^kerlt 

,z, 

T^ =  0  for  z  ^  ;, 

(2.15) 

9; 

is  a  closed  interval  on  the  real  line  that  is  a  subset  of  the  interval  [a*,  6*].  Let  us  denote 
this  interval  by  [a°,6°],  where  the  superscript  reminds  us  that  we  are  analyzing  the  ceise 
c  =  0.  Therefore,  any  solution  (Z°.  1'°)  that  keeps  W{t)  within  the  interval  [a°,b°]  will  be 
a  pathwise  optimal  solution  to  (2.2)-(2.6).  There  may  be  many  pathwise  solutions,  and  we 
propose  to  choose 

U^(t)  =  —    sup  [a°  -  B{s)  +  piJ7°(5)]  +  ,      and  (2.16) 

P2  0<j<t 

r°(0  =  -   sup  [B{s)  +  p2UUs)  -  b']^,  (2.17) 

Pi     Q<3<t 

so  that  W{t)  is  a  one-dimensional  RBM  on  [a°,  6°].  Among  the  pathwise  solutions  to  (2.2)- 
(2.6),  these  values  will  maximize  the  mean  throughput  of  the  network  (see  Proposition  2 
below),  and  hence  will  minimize  the  mean  sojourn  time. 

In  order  to  find  an  effective  sequencing  policy,  we  find  solutions  Z''  and  Z^  that 
satisfy  (2.12)-(2.15)  under  the  two  extreme  points  W  =  a°  and  W  =  6°,  respectively.  All 
classes  that  have  Z^  =  Z^  =0  will  be  in  the  higher  priority  bracket  at  their  respective 
stations,  and  all  other  classes  will  be  in  the  lower  priority  bracket.  At  each  station,  all 
classes  within  the  higher  priority  bracket  are  given  priority  over  all  classes  in  the  lower 
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priority  bracket.  Within  the  higher  priority  bracket,  classes  are  ranked  in  the  same  manner 
as  when  c  — ♦  oc:  this  ranking  maintains  the  spirit  of  minimizing  mean  sojourn  time.  A 
simple  dynamic  policy  (that  will  be  illustrated  in  the  example  in  Section  4)  is  employed 
to  attempt  to  maintain  the  actual  queue  lengths  of  the  lower  bracket  classes  in  the  same 

0  1  0 

relative  quantities  as  Z"    and  Z     . 

Now  let  us  consider  the  general  bi-criteria  case  where  0  <  c  <  cc.  Notice  that  our 
solution  {U\,U2)  for  both  limiting  cases  (see  (2.6)-(2.7)  and  (2.16)-(2.17))  are  of  a  very 
special  form  (they  are  referred  to  as  control  limit  policies  in  Harrison  and  Taksar  [S])  and 
imply  that  VV{f)  in  (2.9)  is  a  RBM  on  some  closed  interval.  We  will  restrict  ourselves  to 
this  class  of  policies  (this  restriction  will  be  justified  shortly),  so  that  a  pohcy  ( Tj ,  L't  )  will 
be  characterized  by  a  particular  closed  interval  [a,fe].  It  is  clear  that  for  0  <  c  <  cxd,  the 
optimal  interval  endpoints  a  and  b  will  satisfy  a'  <  a  <  a'^  and  6°  <  6  <  6*.  where  [a',  6*] 
characterizes  the  solution  when  c  — »  oc,  and  [a°,  b^]  characterizes  the  solution  when  c  =  0. 

For  a  control  limit  policy  {L'\,U2)  characterized  by  the  interval  [a.b],  the  following 
two  propositions  are  well-known  results  (see  Chapter  5  of  Harrison  [6]). 

Proposition  2. 

lim   -E[l\{T)]  =      "=* :  •'  (2.18) 

and 

Hm  -E.[U2iT)]  =      "'  \  '  (2.19) 

T-ocT  [j[b^)  lip,=P2, 

where  v  =  2id/a^ . 

Proposition  3.  U  W(i)  is  a  RBM  on  [a,b].  then  W  has  a  uniform  steady-state 
distribution  on  [a,b]  if  pi  =  p2,  and  otherwise  has  a  truncated  exponential  steady-state 
distribution  with  density  function 

</(r-a) 

p{x)   =    -— — ^ for  a  <  X  <  6.  (2.20) 
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These  results  allow  us  to  analyze  (2.2)-(2.6)  by  considering  the  quadratic  program 


-5ti:(%^-%^)         (221. 


^    2 

m"  "      "     '         ^_'"'  ~    " 


K 

subject  to    ^A7jtZjt  =  U',  (2.22) 

K 

Y^  Zk  =  1.  and  (2.23) 

h  =  i 

Z*>0,fort  =  l A',  (2.24) 

as  a  function  of  the  righthand  side  value  \V.  We  will  abbreviate  the  objective  (2.21)  by 
\Z^CZ,  where  C  is  a  postive  semidefinite  matrix.  The  dual  of  this  quadratic  program 
(see  Dom  [2]j  is  to  choose  (Zi, ...,  Z^-,  tti,  7r2)  to 

max- -Z^CZ-H'tt,  -772  (2.25) 

subject  to    CZ  -  .U-i  -  c^ttj  >    0,  (2.26) 

where  M  =  {M\ Mk)'^  and  e  =  (1 1)-'^.    Since  the  objective  function  in  (2.21)  is 

convex  and  the  constraints  ( 2.22 )-( 2.23)  are  linear,  it  follow^  that  there  is  no  duality  gap. 
Furthermore,  the  dual  objective  function  (2.25),  which  we  denote  by  h{\V},  is  convex  with 
respect  to  \V  on  the  interval  [a*.  6'].  Also,  h{VV)  achieves  a  minimum  of  zero  on  the  interval 
[qq.  bo].  By  the  convexity  of  h(\V),  it  follows  from  Taksar  [23]  that  a  control  limit  pohcy  is 
indeed  optimal  for  (2.2)-(2.6).  Thus,  under  the  policy  characterized  by  the  interval  [a.b], 
the  value  of  the  objective  function  (2.6),  which  we  denote  by  f{a,b).  will  be 


-6  hUh^e^-'^ 


Our  solution  can  then  be  found  by  minimizing  f{a,h)  subject  to  a*  <  a  <  a°  eind 
6°  <  6  <  6*.  In  order  to  develop  a  sequencing  policy,  we  propose  to  proceed  exactly  as  in 
the  limiting  case  c  =  0,  except  use  the  interval  [a,  b]  derived  from  the  solution  to  (2.27)  in 
place  of  the  interval  [a°,6°]  derived  from  (2.12)-(2.15). 
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3.     A  Network  With  Controllable  Inputs 

The  network  considered  in  this  section  is  identical  to  the  closed  network  described  in 
the  previous  section,  except  for  the  manner  in  which  customers  axe  released  into  the  system. 
Here,  customers  are  endogenously  released  according  to  the  control  process  {N(t).t  >  0}. 
where  N{t)  is  the  number  of  customers  released  into  the  network  up  to  time  t.  There 
is  a  constraint  that  the  long  run  expected  average  number  of  customers  departing  the 
network  per  unit  of  time  is  at  least  A.  The  mix  of  customers  released  into  the  network 
is  again  exogenously  generated  according  to  the  product  mix  5  =  (<?/t)-  As  in  Section  2, 
i",  =  ^^-1  ^^Itk  for  I  =  1,2,  but  the  traffic  intensities  are  now  defined  by 

p.  =r, A  for  2  =  1,2.  (3.1) 

The  balanced  heavy  loading  conditions  for  the  network  assume  the  existence  of  a  large 
integer  n  such  that  y/n{l  —  p,)  is  positive  and  of  moderate  size  for  i  =  1.2. 
Define  the  scaled  input  process  {6{t).t  >  0}  by 

\nt  -  .\(nt) 
0{t)  =  ^ — -,  t  >  0.  (3.2) 


and  define  the  scaled  queue  length  process  Z  =  {Zk)  and  scaled  idleness  process  U  = 
{Ui,l'2)  as  in  Section  1.  The  workload  formulation  (see  Section  3  of  Wein  [26])  is  to 
choose  the  RCLL  processes  (U.Z.9)  that  are  nonanticipating  with  respect  to  B  to 


minlimsup  —E 

T—oo     ■'■ 


c[  Y^z,it)dt-,\[  y:y. 


0      /t=l  -  -^O      .=  1  ;  =  1  ^  ^'  '^J 


dt 

(3.3 


subject  to    Y^M,kZk[t)  =  B,{t)^l\{t)-v,e[t)     for  all  ^  >  0  and  e  =  1,2.  (3.4) 

limsup-£[L^(r)]<o.   for  J  =  1,2,                              .  (3.5) 
r— oc    J 

Zk{t)  >    0  for  Jt  =  1 A'  and  for  all  t  >  0,    and  (3.6) 

Ui  and  U2  are  nondecreasing  with  C'i(O)  =  t''2(0)  =  0,  (3-7) 
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where  B  is  a  two-dimensional  Brownian  motion  and  -y,  =  yjn{\  —  p,),  i  =  1.2. 

As  in  Wein  [25].  this  problem  can  be  cinalyzed  by  solving  for  the  control  process  Z  in 
terms  of  the  control  process  U .  In  particular,  we  can  solve  for  Z  by  solving,  at  each  time 
f,  the  following  quadratic  program: 

k  =  \  ~   1=1  }=\  ^  ^'  H)  / 

K 

subject  to    ^A/iZjt(0  =  -B(0  +  P2t'i(0-pi^'2(<)for  t  >  0,  (3.9) 

k-\ 
Zk{t)>QAov  k  =  \ A'.  (3.10) 

where  A/jt  is  defined  in  (2.1),  and  B{t)  =  p2Bi{t)  -  piB2(t)J  >  0,  is  a  one-dimensional 
Brownian  motion  with  drift  p  and  variance  a^ .  At  each  time  f,  this  quadratic  program  has 
different  values  for  the  right  side  of  (3.9),  which  is  again  denoted  by  \V{t).  The  objective 
function  will  be  abbreviated  by  c'^Z(t)  +  ^Z{t)^CZ[t),  where  the  matrix  C  is  positive 
semidefinite. 

The  dual  of  this  quadratic  program  is  to  choose  Z{t)  and  7r(f )  to 

max     -\z^CZ  +  \V7:  (3.11) 

subject  to    \Itt-CZ<    e^c.  (3.12) 

Once  again,  there  is  no  duality  gap,  and  the  optimal  dual  objective  function  h{\V)  is  convex 
with  respect  to  \V  and  achieves  a  minimum  of  zero  at  zero. 

Given  an  optimal  value  of  Z'{t},  the  following  constrained  singular  control  problem  is 
then  solved:  choose  nondecreasing.  RCLL,  nonanticipating  (with  respect  to  B)  processes 
L'l  and  1*2  to 

.T 


min    limsup— £ 
T — ex;     -i- 


hi\V{t))dt 


0 


(3.13) 


subject  to    \V{t)    =   B(t)  +  p2U\{t)  -  p\U2{t)  for  t  >  0,  (3.14) 
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hmsup— £x[f- i(r)]    =    .  (3.1o) 

T— oc     J-  p\  -  P2 

limsupi£.[r2(r)]    =    ILlMf,  (3.16) 

r— oo    -1  P\  -  p2 

if  B  has  drift  p  ^  0-  If  pi  =  p2.  the  5  is  driftless,  and  the  right  sides  of  (3.15)  and  (3.16) 
are  replaced  by  y/npi(l  —  pi ). 

Let  us  summarize  the  solution  procedure  to  the  workload  formulation  (3.3)-(3.7).  The 
controller  observes  a  two-dimensional  Brownian  motion  B.  from  which  can  be  observed  the 
one-dimensional  Brownian  motion  B  =  P2B1  —  pxBj-  The  solution  (Z' ,U' ,6*)  is  given  by 
the  solution  Z'  to  the  quadratic  program  (3.8)-(3.10),  which  depends  on  the  process  W . 
the  solution  V  to  the  Brownian  control  problem  (3.13)-(3.16),  and 

K 

e\t)  =  v;^\Bx{t)  +  Vl{t)-^Mxi,Zl{t)]   foralli>0.  (3.17) 

We  now  turn  to  solving  the  constrained  control  problem  (3.13)-(3.16).  In  W'ein  [25]. 
a  Langrangian  approach  was  used  to  put  the  constraints  (3.15)-(3.16)  in  the  objective 
function  (with  multipliers  r  and  /  for  (3.15)  and  (3.16),  respectively)  for  the  limiting  case 
c  — »  oc.  By  Theorem  6.2  of  Wein  [25],  the  following  conditions  are  sufficient  for  optimality 
of  (3.13)-(3.16).  Let  the  infinitesimal  generator  F  of  B  be  given  by 

\    .  (f  d 

Proposition  4.     Suppose  (y,  V'(j:),  r, /,  a,  6)  satisfy 

Mm  {TV{x]  +  h{x)-g,r  +  V'{x)J-V'{x)]  =    0.                         (3.19) 

V{0]   =   0,  (3.20) 

TV{x)  +  h{x)-  g   =   0      for  a  <  X  <  6,  (3.21) 

V'(x)    =    -r   for  J  <  a,  (3.22) 

V'{x)   =   I      for  I  >  6,  (3.23) 
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limsup  —  £j:[t'i(^)]    =    .    and 


(3.24) 


hmsup-^r  f  2li  ;j    =    ,  (3.2oj 

r-oc     J  P\  -  P2 


where 


ri(0  =  —   sup  [a-B(5)  +  piJ72(3)]  +  ,      and  (3.26) 

P2    0<ir<( 

[-2(0  =  -   sup  [B(6)  +  p2^7(^)-t]'^-  (3-2") 

Pi     0<3<( 

Suppose  V  €  C^  and  there  exisf  constaiits  A'l  and  A'2  such  that  V'(x)  <  A'l  +  Nohii). 
Then  the  optimal  pohcy  to  the  constrained  problem  (3.13)-(3.16)  is  (3.26)-(3.27). 

Following  Section  7  of  Wein  [25].  we  use  Propositions  2  and  3  in  Section  2  to  develop 
a  candidate  solution  to  (3.13)-(3.16).  which  is  characterized  by  the  interval  endpoints  a* 
and  6*.  In  particular.  Proposition  2  is  used  to  show  that  constraints  (3.15)  and  (3.16)  are 
satisfied  with  equality  if  and  only  if  the  interval  width  satisfies 

I  2v^p:(i-p.)  ifp.  =P2. 

Proposition  3  is  then  used  to  show  that  the  objective  (3.13)  is  minimized  by  solving  for  a* 
to  minimize  /(a),  where 

f(a)=Ua  ^  '     (eM^-^^-l)^^        lfpi^P2;  (329) 

I    2^p,(\-p,)  J^°+:y?,;u-„)  /^(j)^j       ifpi=p2. 

Given  the  candidate  policy  (a*.  6'),  the  obvious  cemdidate  for  the  gain  g*  that  satisfies 
the  optimality  equations  is  the  optimal  long  run  expected  average  cost  function  of  the 
Lagrangian.  or 

S-  =  //l<'-)  +  ^7f^  +  ^Jf^       'f"'^^^^  (3.30) 

l/(G')  +  (r  +  /)v/^pi(l-pi)     \ip\=P2- 

The  potential  function  V''(x)  and  the  multipliers  r*  and  /'  that  satisfy  the  optimality 
equations  can  then  be  found  as  in  Section  8  of  Wein  [25].  A  closed  form  solution  {Ul,  V^) 
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to  the  constrained  problem  (3.13)-(3.16)  and  a  proof  of  optimality  is  given  in  [25]  for  the 
Hmitinp  case  c  —*  :>z.  The  solution  to  f3.29)  is  more  difncuh  in  our  general  case  v.-here 
0  <  c  <  oc  (a  numerical  example  is  carried  out  in  the  next  section),  and  the  verificacion 
of  optimality  needs  to  be  done  on  a  case-by-case  basis. 

Now  let  us  interpret  the  optimal  solution  to  the  workload  formulation  in  terms  of 
the  original  queueing  system.  The  solution  {Z*{t).n'{t))  to  the  dual  quadratic  program 
(3.11)-(3.12)  yields  the  dynamic  reduced  costs 

Ck{t)  =  c  +  {CZ')k{t)-  MkT^'{t}   for    k  =  l A',  and  <  >  0,  (3.31) 

where  {CZ')k(t)  denotes  the  ^-th  element  of  CZ'{t).  This  v-alue  measures  the  increase  in 
the  objective  function  of  problem  (3.3)-(3.7)  per  unit  of  increase  in  the  righthand  side  value 
of  the  nonnegativity  constraint  Zk(f)  >  0.  Thus,  the  higher  the  value  of  Ck{t).  the  more 
costly  it  is  to  hold  a  class  k  customer  in  queue  at  time  t.  As  in  Wein  [26].  we  propose  the 
policy  that  awards  higher  priority  at  time  t  to  the  classes  wi'h  the  higher  dynamic  reduced 
costs  Ck{t).  If  more  than  one  class  at  the  same  station  have  a  dynamic  reduced  cost  equal 
to  zero,  then  we  use  the  tie-breaking  rule  proposed  in  Yang  ^31]  and  also  described  in  Wein 
[27]. 

The  proposed  input  policy  is  to  release  a  customer  into  the  network  whenever  the 
two-dimensional  workload  process  enters  a  specific  region  in  the  nonnegative  orthant  of 
R^ .  This  region  is  derived  from  the  interval  endpoints  [a'.b']  and  from  the  solution  Z'  of 
the  quadratic  program.  The  policy  is  most  easily  described  with  a  concrete  example,  and 
so  we  defer  its  description  until  the  next  section. 

4.     An  Example 

We  will  illustrate  the  procedure  described  in  Sections  2  and  3  with  the  simple  example 
displayed  in  Figure  1,  which  %s  .^  first  studied  in  Harrison  and  Wein  [9]  and  Wein  [26].  There 
are  two  customer  types,  .4  and  B,  and  type  A  customers  have  two  stages  on  their  route, 
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while  type  B  customers  have  four  stages.   Thus,  there  are  six  customer  classes,  and  they 

will  be  designated  (and  ordered  from  k  =  1 6)  by  .41.  .42.  SI.  B2.  S3,  and  B-i.    For 

concreteness,  all  processing  times  are  assumed  to  be  exponential,  and  the  mean  processing 
times  for  all  six  classes  are  displayed  in  Figure  1.  The  specified  product  mix  is  q  = 
(1/2.0, 1/2,0,0,0),  and  customers  are  released  into  the  network  in  the  order  ABABAB... 


A 
B 


4 


8 


STATION 
1 

A2 

1 

B2 

p 
6 

2^ 

^B3 

B4 

7 

STATION 
2 


Exn 


EX! 


Figure  1.    An  example. 


The  workload  profile  matrix  A/  =  (A/.t)  is  given  by 

,,       ,4     0     10      2      2     0 

A/  = 


1     1     13     13     7     7  '  '  ^^-^^ 

where  A/,^:  is  the  expected  remaining  processing  time  at  station  i  for  a  class  k  customer  until 
that  customer  exits.  .A.lso,  the  drift  \i  and  variance  cr^  of  the  one-dimensional  Brownian 
motion  B  was  calculated  in  Wein  [26]  to  be  zero  and  10.93.  respectively. 

For  the  closed  network  problem  of  Section  2,  we  have  I'l  =  r2  =  7  so  that  p^  —  p^  =■  1. 
Therefore,  when  c  — +  cc,  the  proposed  sequencing  policy  ranks  each  customer  class  by  the 
index 

A/^  =  (3   -1-3-11-5-7)        for   k  =  \ 6,  (4.2) 

and  gives  priority  (from  highest  to  lowest)  in  the  order  (B3,  Bl,  Al)  at  station  1  and  (A2, 
B4,  B2)  at  station  2. 
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Turning  now  to  the  limiting  case  c  =  0,  it  can  be  calculated  that  the  interval  [a°.  6°]  sat- 
isfying (2.12)-(2.15)  is  [-6.0],  which  is  strictly  contained  in  the  interval  [a'.b']  =  [-11. 3\ 
which  is  calculated  from  (4.2).  When  U'  =  -6.  the  solution  is  Z""  =  (0.1/2.0.1/2.0.0) 
and  when  \V  —  0.  the  solution  is  Z*  =  (1/2,0,1/2,0,0,0).  Thus  we  give  top  priority  to 
class  B3  at  station  1  and  class  Bi  at  station  2,  since  these  classes  are  in  the  higher  priority 
bracket. 

In  order  to  decide  how  to  award  priorities  among  the  classes  that  are  in  the  lower 
priority  bracket  at  each  station,  a  simple  dynamic  scheme  is  used  that  attempts  to  keep 
the  number  of  customers  in  queue  of  each  class  in  the  relative  proportions  dictated  by 
Z"  and  Z^  .  For  simphcity.  we  will  describe  this  heuristic  in  the  case  where  there  are 
two  classses.  call  them  classes  1  and  2,  in  the  lower  priority  bracket  of  a  given  station. 
Let  Z]  and  Z2  be  the  positive  values  of  these  classes  from  the  solution  Z"  and  Z*"  .  The 
idea  behind  the  heuristic  is  to  keep  the  proportion  of  class  1  customers  in  queue  to  be 
Zi/(Zi  -f  Zo).  Suppose  at  time  t.  there  are  Q\(t)  >  0  class  1  customers  and  (5:(0  >  0 
class  2  customers  in  queue,  and  the  server  has  just  completed  a  service  and  is  ready  to 
begin  serving  a  new  customer.  If  the  server  serves  a  class  1  customer  with  probability 
p{t)  and  a  class  2  customer  with  probabiUty  1  —  p(t),  then,  after  the  choice  is  made,  the 
expected  proportion  of  total  jobs  in  queue  that  are  of  class  1  is 

p(f)lQi{t)-l)  +  il-p{t])Qiit) 


Qi(t)  +  Q2{t)-l 
Setting  this  equal  to  the  desired  proportion  Zi/(Zi  -f  Z2)  and  solving  for  p{t)  yields 


(4.3) 


p{i)  =  - '     '  ■  (4.4) 

Z,i  -\-  Z2 

The  dynamic  heuristic  is  to  serve  class  1  customers  at  time  i  if  p(/)  >  1,  serve  class  2 
customers  if  p{t)  <  0,  and  serve  class  1  customer  with  probabihty  p{t)  if  p{t)  G  (0, 1). 

Returning  to  our  example,  from  the  solution  Z"     (respectively,  Z     ),  it  is  desirable 
to  keep  the  queue  lengths  of  classes  ,42  and  B2  (respectively,  .41  and  Bl]  at  station  2 
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(respectively,  station  1)  as  equal  as  possible,  and  thus  our  dynamic  heuristic  described  in 
■;4.4'i  takes  on  a  particularly  simple  form.  When  no  class  B3  customers  are  available  for 
processing  at  station  1,  we  serve  class  .41  customers  when  Qiit)  >  Qiit)  and  serve  class 
Bl  customers  when  Qzit)  >  Qi(t).  and  alternate  (or  flip  a  fair  coin)  when  there  is  a  tie. 
Similarly,  when  there  are  no  class  54  customers  in  queue  at  station  2,  we  serve  class  .42 
customers  when  QafO  >  Qii^)  ^'^^  serve  52  class  customers  when  Qiit)  >  Q2{t)- 

For  the  general  case  where  0  <  c  <  oo,  the  optimal  dual  objective  function  h{\V)  in 
(2.25)  is  given  by 

r(6±H:)2    if  iVe  [-11.-6]; 

h{\V)=  I  0  if  Vi'6  [-6,0];  (4.5) 

[^  ifU'€[0,3], 

and  thus  f{a.b}  is  given  by 

,      ,  1      /ca2       63       q3       jgQ2       ;^08a      216n  ,,^^ 

/(a, 6)  =  - (_  +  _-_-_ _  _  —    .  (4.6) 

0  —  a  \   2         2t        (0  lO  to  to  / 

Therefore,  our  optimal  interval  endpoints  can  be  found  by  minimizing  f{a.b)  over  a  € 
[-11,-6]  and  be  [0,3]. 

Now  let  us  consider  the  numerical  example  in  the  context  of  a  network  with  con- 
trollable inputs.  If  we  choose  a  long  run  average  throughput  rate  of  .1286  customers  per 
unit  of  time  and  choose  the  scaling  parameter  n  =  100,  then  p\  =  P2  =  -9.  The  solution 
{Z'[t).-*{t))  to  the  dual  quadratic  program  (3.11)-(3.12)  is  given  in  Table  I,  and  the  dy- 
namic reduced  costs  (3.31)  are  given  in  Table  II.  In  each  of  these  tables,  the  solution  and 
costs  have  been  broken  down  into  three  regions  (corresponding  to  the  columns  in  the  two 
tables),  depending  upon  the  value  of  the  workload  imbalance  process  Vr(f).  The  optimal 

dual  objective  function  h{\V)  in  (3.11)  is  given  by 

r_i^_£»:     if  Tr  < -55c/12; 
h(\V)  =  I  -4  +  ft^      if  W  e  [-55C/12.0];  (4.7) 

[cHl  +  ivl         if^V'>0. 

By  (3.28),  it  follows  that  b'  -  a'  =  6.072,  which  we  will  denote  by  k.  Solving  (3.29)  yields 

^^^^.Ic-k+^lc'  +  lck     if  ce  [0,6^35];  ^^  g^ 

-^k  iic>6k/3o. 
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\'ARIABLE  \V(t)<-5oc/12  \V(t)  e  [-boc;i2.0]      \V(t!>0 


Z'(t] 


\V(t) 


z;{t)  0  0  0 

^Ay^)  12      "^144  11  ^ 

^5(0  0  0  0 

Z;{t)  0  0  0 

"i'j  6  12111  3''"9 

TABLE  I.    Solution  to  the  Dual  Quadratic  Program  (3.11)-(3.12). 

In  order  to  interpret  the  solution  to  the  workload  formulation  given  in  Table  I  and 
(4.Sj.  let  us  consider  the  limiting  example  c  — •  CC'  analyzed  in  Wein  [26];  readers  may 
verify  that  the  solution  derived  here  in  the  limiting  case  is  identical  to  the  solution  found 
in  Section  7  of  Wein  [26].  In  this  case,  there  are  only  two  regions  (i.e.,  two  columns) 
to  consider  in  Tables  I  and  II,  and  it  is  easily  seen  that  the  sequencing  policy  based  on 
the  dynamic  reduced  costs  in  Table  II  ranks  all  classes  by  the  index  M^  in  (4.2),  and 
serves  the  class  with  the  smallest  (respectively,  largest)  value  of  the  index  when  \V{t)  >  0 
(respectively.  \V(t)  <  0).  From  Table  1,  we  see  that  only  one  component  of  Z'  is  positive  at 
any  time  t  and  only  two  components  are  ever  positive.  Thus  the  two-dimensional  workload 
process  defined  by  W,(t)  =  Ylk-i  -^UkZkif)  for  ?  =  1,2  and  t  >  0,  stays  on  the  boundary 
of  a  cone  in  the  nonnegative  orthant  of  R'^.  Furthermore,  the  interval  [a*,  6*],  which  equals 
[-4.771, 1.301]  by  (4.8),  determines  cutoff  points  on  the  cone  boundary  beyond  which  the 
workload  process  may  not  enter  (see  Figure  2).  As  explained  in  Wein  [26],  the  control 
process  9{t),  which  can  move  either  way  along  the  45  degree  direction  in  Figure  2  (see 
(3.4)),  is  used  to  keep  the  workload  process  on  the  truncated  cone  boundary  in  Figure  2. 
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Thus,  when  the  workload  process  is  in  the  region  to  the  lower  left  of  the  truncated  cone. 

then  exertiiig  c'  corresponds  to  releasing  more  jobs  into  the  system  relative  to  the  nonii:.;;! 
input  rate  A. 

CLASS  \V(f)  <  -ooc/U  \V{t)  e  [-55C/12.0]      W{t)  >  0 


.41 


2c 
3 


3      '  9 


Bl  ^  ^  -  -^^^^  2c 


B2  0  0  i|^  +  iililll 

DO                                                    _  tc  i^>t  I  ')  8c     I     4Vr(0 

^•^                                                  C  1  1  1  Ti  3      '"         9 

2c  4c  81V'(()  lOc     1     BVV(t) 


11 

'    121 

10c 
11 

,  24ir(o 
"^   121 

8c 
11 

16lV(() 
121 

0 

6c 

11 

121V(0 

121 

l£ 

81V'(() 

4  c     ,     B\V{t) 


3  11121  3"^  9 

TABLE  n.    Dynamic  Reduced  Costs. 

The  resulting  release  policy  is  called  a  workload  residating  release  policy,  and  it 
releases  a  job  into  the  network  whenever  the  unsealed  workload  process  u'{t),  where 
W{t)  =  u'(nt)/ y/n,  enters  a  specific  region.  For  our  example,  this  region,  which  is  shaded 
in  Figure  2,  is 

u'i(0<19     and  (4.9) 

"■2(0-7U':(f)<-6,  (4.10) 

4  4 

or 

u'2(0<62     and  (4.11) 

iri{t)-^W2{t)<^e,  (4.12) 

where  the  parameter  e,  which  shifts  the  cone  vertex  from  the  origin  to  the  point  (e.  e),  is 
chosen  so  that  the  desired  throughput  rate  is  met.    Choosing  e  =  1  achieved  the  target 
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throughput  rate  in  the  simulation  run  that  appears  later  in  this  section.  A  more  detailed 
explanation  that  interprets  the  solution  (Z' ,U' ,0')  of  the  workload  formulation  (3.3)- 
(3.7j  in  order  to  obtain  the  workload  regulating  release  policy  denned  by  (4.9;-(4.12j  can 
be  found  in  Section  6  of  Wein  [26]. 


W 


2 


w 


Figure  2.    The  release  region  when  c  — >  ex:. 


For  the  case  when  0  <  c  <  oo,  the  sequencing  policy  can  easily  be  calculated  from 
the  dynamic  reduced  costs  in  Table  11,  but  the  determination  of  the  release  policy  is  more 
complicated.    In  the  limiting  case  c  =  0,  equation  (4.8)  implies  that  [a',  6']  =  [—6.072,0] 
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and  thus  W(t)  will  always  be  in  the  leftmost  region  in  Table  I.  From  this  table,  it  follows 
that  Z'(i)  =  Zl{t^  =  -\\'lf  ]/l2.  and  so  the  workload  process  stays  on  a  segment  of  the 
line  ~\V\it)  =  U2(f  j.  The  release  policy  in  this  case  (see  Wein  [26]  for  an  explanation)  is 
to  release  a  job  whenever 


u'2(0  <  "1     and    7u'i(0  -  u'2(0  <  6e, 


(4.13) 


which  is  pictured  in  Figure  3.   Once  again,  e  =  1  achieved  the  desired  throughput  rate  in 
the  simulation  experiment. 


W 


2 


w 


Figure  3.    The  release  region  when  c  =  0. 


When  0  <  c  <  oo,  then  the  release  region  will  be  as  pictured  in  Figure  4.  The  slope 
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of  the  lower  ray  is  1/4.  just  as  it  is  in  Figure  2.  The  upper  shaded  region  in  Figure  4  is 
formed  by  a  line  that  ha?  a  slope  of  13/2  (the  same  as  the  upper  ray  in  Figure  2)  and 
another  line  that  has  a  slope  of  7  (which  is  parallel  to  the  upper  ray  in  Figure  3.)  The 
intersection  of  these  two  lines  is  at  VVit)  =  — 56c/12,  which  is  the  cutoff  point  between  two 
of  the  regions  in  Table  I. 


W 


2 


Slope  =  7 

Slope  =  13/2 
Slope  =  1  /4 


w 


Figure  4.    The  release  region  when  0  <  c  <  oc. 


Before  turning  to  the  simulation  results,  there  is  one  refinement  that  has  been  made 
to  the  scheduling  policies  described  earlier.  Notice  in  Table  II  that  many  of  the  scaled 
queue  length  processes  are  zero.     Due  to  the  rescaling,  Z  measures  how  many  tens  of 

26 


customers  are  in  the  actual  network.  This  measurement  is  obviously  too  crude  to  account 
for  the  customers  who  are  in  service,  arid  so  we  propose  a  slirht  refinement  to  the  surrogate 
sojourn  time  \'ariance  that  essentially  reinterprets  Z  as  only  the  scaled  number  of  customers 
in  queue,  not  including  the  customers  in  service.  This  refinement  will  hopefully  cause  a 
reduction  in  the  surrogate  sojourn  time  varicuice,  ajid  it  will  be  described  in  terms  of  the 
numerical  example.  For  this  numerical  example,  each  server  is  to  be  utilized  about  909c 
of  the  time,  and  we  will  denote  this  by  p  —  .9.  From  Figure  1,  the  mean  number  of  type 
A  customers  in  service  is  5p/14  and  the  mean  number  of  type  B  customers  in  service  is 
23/?/14.  The  refinement  changes  the  surrogate  sojourn  variance  from 

2 


Z,(0  +  Z.(t)       Z^it)  +  Z,ii)  +  Z,(t)  +  Zeit] 


V         1/2  1/: 

in  (2.21)  and  (3.S)  to 


Z,(i)  +  Z2(t)  +  ^       Z^{i)  +  Zi{t)  +  Z5{t)^Zeit)^^ 


(4.14) 


2 


V  1/2  1/2 

in  (2.21)  and 

Ziit)  +  Z2(t)  +  jf^       Z,it)  +  Z,{t)  +  Zs{t)  +  Ze{t)  +  -^ 


(4.15) 


(4.16) 


1/2  1/2 

in  (3.S).  In  our  simulation  experiment,  this  refinement  was  only  analyzed  for  the  case  c  =  0 
in  the  closed  network.  This  is  a  particularly  easy  ccise  to  evaluate,  since  the  only  difference 
in  the  analysis  is  to  change  equation  (2.15)  from 

Z,  +  Z2  -  Z3  -  Z4  -  Zs  -  Ze  =  0  (4,17) 

to 

Zi  +  Z2  -  Z3  -  Z4  -  Z5  -  Ze  =  ^.  (4.18) 

A  simulation  experiment  was  performed  to  assess  the  effectiveness  of  the  analysis 
presented  in  Sections  2  and  3.    Two  conventional  policies,  which  consist  of  a  customer 
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release  policy  and  a  priority  sequencing  policy,  were  tested.  One  policy  was  deterministic 
input  (abbreviated  by  DET  in  Table  III),  where  customers  are  released  at  constant  inter^.-al- 
in  the  order  ABABAB....  paired  with  FCFS  sequencing,  and  the  second  was  closed  loop 
input  (CL).  where  the  total  population  level  was  held  constant,  and  FCFS.  The  closed 
release  policy  was  also  tested  in  conjunction  with  the  sequencing  policies  derived  in  Section 
2  under  the  two  limiting  cases,  c  —*  oc  and  c  =  0.  The  c  =  0  policy  used  the  refinement 
described  in  equation  (4. IS).  The  job  release  and  priority  sequencing  policies  derived  in 
Section  3,  which  will  be  denoted  by  \VR  (for  workload  regulating),  were  also  tested  under 
the  two  limiting  cases. 

In  order  to  allow  for  a  comparison  of  the  various  cases,  we  have  set  the  parameters  .V 
and  e  so  that  each  policy  achieves  the  same  average  throughput  rate,  which  for  convenience 
was  chosen  to  be  .127  customers  per  unit  of  time,  which  corresponds  to  a  server  utilization 
of  SS.97t.  Due  to  the  discrete  nature  of  the  population  parameter  .V,  we  were  not  always 
able  to  obtain  this  target  rate  e.xactly.  In  such  cases,  we  havo  reported  hnear  interpolations 
of  the  various  performance  measures  so  that  the  average  throughput  rate  would  be  .127. 
For  each  policy  tested.  20  independent  runs  were  made,  earh  consisting  of  5000  customer 
completions  axid  no  initialization  period.  The  mean  sojourn  time,  the  actual  sojourn  time 
standard  deviation  (as  opposed  to  the  surrogate  sojourn  time  standard  deviation),  and  the 
mean  throughput  rate,  along  with  959c  confidence  intervals  are  recorded  in  Table  III. 

Referring  to  the  results  in  Table  III,  it  is  seen  that  the  (DET, FCFS)  policy  achieved 
a  sojourn  time  standard  deviation  that  was  twice  as  high  as  most  of  the  other  policies, 
implying  that  job  release  has  a  large  impact  on  the  sojourn  time  variance.  Of  course,  in 
many  manufacturing  systems,  the  amount  of  time  a  job  spends  waiting  to  gain  entrance 
onto  the  shop  floor  is  also  of  importance,  and  that  time  is  ignored  in  this  study;  readers 
are  referred  to  Wein  and  Chevalier  [2S]  for  a  scheduling  study  where  this  time  is  taken  into 
account.  Minimizing  the  surrogate  sojourn  time  variance  appears  to  be  am  effective  means 
of  reducing  the  actual  sojourn  time  variance,  since  the  two  c  =  0  cases  reduced  the  sojourn 
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time  variance  relative  to  the  corresponding  c  — »  oo  cases.  The  (CL.c  =  0)  case  achieved 
the  lowest  sojourn  time  variance,  perhaps  in  part  because  of  the  refinement  described  in 
(4.18). 


SCHEDULING 

MEAN- 

VARIANCE  OF 

MEAN 

POLICY 

SOJOURN  TIME 

SOJOURN  TIME 

THROUGHPUT 

DET,FCFS 

90.6  (±5.0) 

61.3  (±3.83) 

.127  (±.0000) 

CL,FCFS 

71.7  (±.45) 

31.6  (±.54) 

.127  (±.0008) 

CL,c  — »  oc 

50.4  (±.31) 

27.9  (±.27) 

.127  (±.0009) 

CL.c  =  0 

6S.9  (±.43) 

25.4  (±.26) 

.127(±.000S) 

WR.c  ^  oo 

39.1  (±.52) 

33.0  (±.53) 

.127  (±.0009) 

WR.c  =  0 

53.3  (±1.45) 

29.5  (±.71) 

.127  (±.0008) 

TABLE  III.    Simulation  Results  for  Example  1. 


For  this  particular  example,  the  difference  in  mean  sojourn  times  between  the  c  =  0 
and  c  — >  oc  cases  is  much  larger  than  the  corresponding  difference  in  sojourn  time  variance. 
Thus,  relative  to  the  c  — >  do  case,  the  queueing  system  would  have  to  incur  a  large 
increase  in  mean  sojourn  time  in  order  to  achieve  a  relatively  small  decrease  in  sojourn  time 
standard  deviation.  However,  this  phenomena  is  problem  specific,  and  can  be  explained  by 
analyzing  the  sequencing  policy  for  the  (CL,c  — +  oo)  case  given  in  (4.2).  and  by  recalling  the 
following  fact  about  priority  queueing  systems.  When  a  queueing  system  is  heavy  loaded 
and  static  priorities  are  used,  then  the  bottom  priority  class  at  each  server  incurs  much 
more  delay  in  queue  than  the  other  classes;  as  mentioned  earUer,  this  has  been  quantified 
in  several  heavy  traffic  limit  theorems.  Since  product  -4  gets  bottom  priority  at  station  1 
and  product  B  gets  bottom  priority  at  station  2  under  the  (CL,c  — ♦  oc)  case,  the  surrogate 
sojourn  time  variance  for  this  policy  was  not  that  large,  and  weis  less  than  it  was  under 
the  (CL,FCFS)  case.   Hence,  for  this  particular  example,  the  (CL,c  — »  oo)  case  provides 
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a  simultaneous  reduction  in  the  mean  and  variance  of  the  sojourn  times.   Furthermore,  a 
large  reduction  in  sojourn  time  variance  relative  to  the  (CL.c  — ►  oo)  case  is  not  possible. 

On  the  other  hand,  consider  our  same  example,  but  now  change  the  mean  processing 
times  for  the  six  classes  from  (4,  1.  8.  6,  2,  7)  to  (2,  6,  4,  1,  8,  7);  we  will  denote  the  example 
using  this  new  data  set  as  example  2.  Now  the  (CL,c  — »  oo)  policy  awards  lowest  priority 
to  type  B  products  at  both  stations,  and  so  the  surrogate  variance  is  now  very  large  for 
this  policy,  and  is  larger  than  the  (CL,FCFS)  case.  Therefore,  we  might  expect  that  the 
difference  in  sojourn  time  variances  between  the  two  cases  (CL,c  — *  oc)  and  (CL,c  =  0) 
will  be  larger  than  for  our  original  example. 

SCHEDULING  MEAN  VARIANCE  OF  MEAN 

POLICY  SOJOURN  TIME        SOJOURN  TIME        THROUGHPUT 

CL.FCFS  71.6  (±.36)  31.6  (±.22)  .127  (±.0007) 

CL,c-^oc  52.8  (±.21)  37.9  (±.21)  .127  (±.0007) 

CL.c  =  0  63.8  (±.24)  23.2  (±.20)  .127  (±.0007) 

TABLE  IV.    Simulation  Results  for  Example  2. 

Simulation  results  for  example  2  are  displayed  in  Table  IV.  As  expected,  the  sojourn 
time  variance  for  the  (CL.c  — >•  oc)  case  is  larger  than  the  (CL.FCFS)  case,  and  there  is 
a  large  diffeence  in  sojourn  time  variance  between  the  two  extreme  cases  of  (CL,c  — >  oo) 
and  (CL,c  =  0).  Thus,  relative  to  the  (CL,c  — »  oo)  case,  a  large  decrease  in  sojourn  time 
variance  can  be  achieved  while  incurring  a  somewhat  moderate  increase  in  mean  sojourn 
time.  Also,  as  in  the  original  example,  example  2  offers  a  policy,  the  (CL,c  =  0)  policy, 
that  simultaneously  reduces  the  mean  and  variance  of  processing  times  relative  to  FCFS. 

Perhaps  the  most  interesting  result  from  this  study  is  that  FCFS  is  not  particularly 
effective  at  minimizing  sojourn  time  variance  in  complex  queueing  systems.  This  paper 
offers  some  aid  in  developing  release  and  sequencing  policies  to  reduce  the  sojourn  time 
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variance  in  such  systems. 
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